Ahsan Ijaz
Bag of words model
Sample sentence with matrix:
Emphasizes important words
Appears rarely in corpus (rare globally)
\[ \textit{Inverse doc frequency} = \log\frac{\textit{#docs}}{1 + \textit{#docs using the word}} \]
Notion of closeness?
\[ \textit{distance}(x_i,x_q) = |x_i - x_q| \]
\[ \textit{distance}(x_i,x_q) = \sqrt{(a_1(x_i[1]-x_q[1])^2+\ldots+a_d(x_i[d]-x_q[d])^2))} \]
\[ \frac{\mathbf{x}_i^{T}\mathbf{x}_q}{\|x_i\|\|x_q\|} \]